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From their inception, Y chromosomes 
in plants and animals are subjected to 
the powerful effects of Miiller's ratchet, a 
process spurred by suppression of recom- 
bination that results in a rapid accu- 
mulation of mutations and repetitive 
elements. These mutations eventually 
lead to gene loss and degeneration of the 
Y chromosome. Y chromosomes in mam- 
mals are ancient, whereas most sex chro- 
mosomes in plants and many in insects 
and fish evolved recently. Sex type in 
papaya is controlled by a pair of nascent 
sex chromosomes that evolved around 7 
million years ago. The papaya X and Y"* 
were recently sequenced, providing valu- 
able insight into the early stages of sex 
chromosome evolution. Here we discuss 
the fruits of this work with a focus on 
the repeat accumulation, gene trafficking 
and promiscuous DNA sequences found 
in the slowly degenerating Y*" chromo- 
some of papaya. 

Papaya as a Model 
for Sex Chromosome Evolution 

Sex chromosomes are widespread in plants 
and animals, and they have evolved inde- 
pendently numerous times. ''^ But despite 
their independent origins, sex chromo- 
somes are all shaped by a common set 
of forces and share a number of features. 
Sex chromosomes evolve from autosomes 
and arise through suppression of recom- 
bination around sex determination loci. 
The lack of recombination fixes the genes 
that control sex, fostering separate male 
and female individuals. Once the recom- 
bination ceased, the sex chromosomes 
diverge, creating distinct chromosome 



types. In humans, males are heterogamet- 
ics with XY chromosomes, whereas in 
birds, females are heterogametic with ZW 
chromosomes. 

Without recombination, purifying 
selection is relaxed in Y and W chro- 
mosomes, causing a slow accumula- 
tion of deleterious mutations through 
a process known as Miiller's ratchet.' 
Recombination removes harmful muta- 
tions, but in the case of heteromorphic 
sex chromosomes, there is no homologous 
chromosome to recombine with, and this 
process is eliminated. Furthermore, selec- 
tion of favorable alleles can "hitchhike" 
along mildly deleterious mutations.'' The 
accumulation of deleterious mutations 
eventually causes the Y and W chromo- 
somes to degenerate, rapidly losing genes 
and gaining repeat sequences. Y chromo- 
somes from plants and animals vary in 
age and are at different stages of evolu- 
tion. Mammalian Y chromosomes are the 
most ancient, evolving around 166 million 
years ago.^ The human Y chromosome 
is extensively degraded and three times 
smaller than the X, retaining only 78 
genes in the male specific region.'' In con- 
trast, the young Y chromosome of white 
campion, Silene latifolia, evolved about 10 
million years ago and has since ballooned 
to 570 Mb in size; 150 Mb larger than its 
X counterpart.^'** The sex chromosomes in 
papaya are young with limited gene loss 
and expansion, serving as a model for the 
early stages of sex chromosome evolution. 

Sex determination in papaya [Carica 
papaya L.) is controlled by a pair of 
nascent sex chromosomes, differentiated 
by an 8.1 Mbp, recombinationally sup- 
pressed, hermaphrodite-specific region 
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Figure 1. Sequence comparison of the HSY and X chromosomes of papaya. Major DNA components were classified into exons (blue) and retrotranspo- 
sons (cyan) and plotted using a sliding window of 50 kb and a shift of 1 0 kb. The gray portion represents unclassified DNA content. Heatmaps showing 
relative abundance of exons, retrotransposon and chloroplast fragments were plotted from low (blue), medium (yellow), high (red) ranges. Paired 
genes in the X and HSY are connected using blue lines and clearly show the two large scale inversions and the collinear region. 



on the Y chromosome (HSY).' There are 
two sHghtly differem Y chromosomes in 
papaya: Y controlling male and Y** con- 
trolling hermaphrodite sex. Any com- 
bination of the Y and Y*" chromosomes, 
YY, Y Y'' and Y'> Y\ are lethal resuking in 
genotypes of XY, X Y'^ and XX for males, 
hermaphrodites and females respec- 
tively. The X and Y*" chromosome (here 
on referred to as HSY) of papaya were 
recently sequenced and annotated using 
a tedious BAG by BAG approach.'"'" 
Together, we sequenced a total of 68 over- 
lapping BAGs for the 8.1 Mb HSY and 
43 BAGs for the 3.5 Mb X. The papaya 
X and HSY represent the second pair 
of sex chromosomes to be completely 
sequenced, and have uncovered a number 
of findings in the early stages of sex chro- 
mosome evolution. 



Retrotransposon Accumulation 
and Novel Sex Specific Repeats 

Repeat accumulation is common in Y 
chromosomes from animals,'"'" and 
has recently been documented in sev- 
eral plants, including hemp {Cannabis 
stativa);^'' liverwort (Marchantia polymor- 
pha);^'' white campion {Silene latifolia)-^^^ 
and papaya.'^ The overabundance of 
repeat sequences in sex chromosomes is 
due to suppressed recombination and the 
lack of a mechanism to remove them. 

The papaya HSY is largely repetitive, 
and the accumulation of repeats accounts 
for much of its expansion in relation to the 
X. 79.3% of the HSY sequence is com- 
posed of repeat sequences, compared with 
67.2% of the X.'° The papaya genome is 
about 51% repetitive,'^ suggesting that 



repeat accumulation has occurred exten- 
sively in both the X and HSY. A closer look 
at the repeats indicates that most are retro- 
elements with an abundance of Tyl-copia 
and Ty3-gypsy retrotransposons (Fig. 1). 
Ty3-gypsey elements are twice as abundant 
in the X and HSY as in the autosome, 
with Tyl-copia elements showing a simi- 
lar but less remarkable expansion. DNA 
transposons are found in both the X and 
HSY, but at numbers comparable to the 
autosome. The origin and classification of 
many transposable elements in the papaya 
HSY is unknown, as repeated integration 
and shuffling has fragmented many of the 
repeats. In addition to the transposable 
elements, we identified 157 pairs of direct 
and inverted duplications that represent 
6% of the HSY. This is remarkably higher 
than the X which contains 8 pairs of 
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duplications which make up a mere 0.5% 
of the total sequence.'" 

Two sequence blocks account for much 
of the expansion seen in the HSY. These 
blocks are mostly composed of repeats, 
with 84.6% and 87.2% of the sequences 
identified as repetitive. Ty3-gypsy ele- 
ments are the most abundant repeat in 
these regions, followed by other retroele- 
ments, inverted duplications, microsatel- 
lites and unclassified sex specific repeats. 
Not surprisingly, the expansion blocks are 
extremely gene poor and correspond to the 
heterochromatic knob structures found 
on the HSY. The expanded regions have 
no homology to the X, suggesting they 
arose after divergence. Free from the selec- 
tive pressures enforced by recombination, 
these regions gradually expanded, accu- 
mulating more repeats. 

We identified 21 sex specific repeats 
(20 in HSY and 1 in X) which are absent 
from the autosome and have no homology 
to other plant repeats. Sex specific repeats 
in the HSY are mostly localized to the het- 
erochromatic regions and together repre- 
sent 10.7% of the total sequence. X specific 
repeats are more dispersed and constitute 
3.5% of the sequence. The origin of the 
sex specific repeats in papaya is unknown, 
but this phenomenon is common in plant 
sex chromosomes. Hemp contains male 
specific LINE-like retrotransposons, and 
in liverwort, sex specific repeats harbor 
male specific genes.'*'" 

Gene Loss and Trafficking 
in the HSY 

The Y chromosome in humans is an 
ancient relic of the ancestral autocome 
from which the X and Y have diverged; 
it is largely degenerated, having lost the 
vast majority of its original gene content 
throughout the course of its evolution. 
The papaya sex chromosomes are much 
younger, and most genes are conserved 
between the X and HSY. 106 protein cod- 
ing genes were identified in the X and 
HSY, including 50 paired genes found 
in both chromosomes as well as 22 HSY- 
specific and 34 X specific genes. One 
quarter of the genes found in the HSY are 
pseudogenes, compared with 14% in the 
X. The high percentage of pseudogenes 
in the HSY are likely a result of Miillers 



ratchet, and the inability to select against 
slightly deleterious mutations. The HSY 
has lost at least one essential gene to the 
forces of Miillers ratchet, as the Y'' Y'' 
genotype is lethal, and embryos abort 
25-50 d after pollination." Gene density 
in the HSY is seven times lower than the 
genome-wide-average with one gene per 
112.5 kb (Fig. 1). The low gene density 
is the consequence of accumulation of 
repetitive sequences and degeneration of 
its gene content. 

We used the 50 paired genes to survey 
the syneny relationship between the X and 
HSY. Three distinct regions are evident, 
including two large scale inversions and 
a collinear region (Fig. 1). Genes in the 
two large inversions form distinct evolu- 
tionary strata which indicate that the first 
inversion occurred about five million years 
before the second, which occurred about 
1.9 million years ago. The first inversion 
likely contains the sex determination loci, 
and the lack of recombination in this 
inverted region spurred the development 
of sex chromosomes. The second inversion 
expanded the size of the HSY. Genes are 
conserved and collinear in the third region 
and sequence differences gradually fade to 
zero as the sex chromosome transitions to 
the pseudoautosomal region. 

Ten of the HSY specific genes we iden- 
tified have copies elsewhere in the genome, 
including seven protein colding genes 
and three pseudogenes, and are likely the 
result of translocation from the autosome. 
Translocated genes are the product of two 
similar mechanisms: transposons captur- 
ing portions of neighboring genes through 
transduplication^" or intron-less mRNAs 
that are reverse transcribed and inte- 
grated into a new chromosomal location 
via retrotransposition.^' Most transposed 
genes are pseudogenes containing a few 
exons from the autosomal copy, but the 
rest are free to mutate and acquire novel 
functions. Newly acquired genes that are 
absent from the X may play a role in sex 
determination, or sex specific traits such 
as flower and fruit morphology. Most of 
the autosome derived genes in the HSY are 
less than 300 bp and display no homol- 
ogy to known proteins, with a few nota- 
ble exceptions. CpYhl9 contains a partial 
MADS box domain sequence and may be 
involved in flower development. PCpYhl 



has homology to HB22 from Arabidopsis, 
a Zinc finger homeodomain protein 
involved in embryo development and 
seed dormancy.^^ Translocated genes with 
known functions also include an ATP syn- 
thase subunit and photosystem II protein 
Dl. In S. latifolia, a gene involved in floral 
development ^/^K was translocated from 
the autosome to the Y chromosome, and 
may be involved in sex determination.^' A 
similar phenomenon may be occurring in 
papaya, several of these translocated genes 
on the HSY could play a role in sexual 
reproduction. 

Chloroplast DNA Accumulation 
in the Absence of Recombination 

Organelle derived sequences are abundant 
in plant nuclear genomes, and organelle 
to nucleus transfers are common. The 
Arabidopsis genome, for instance, con- 
tains a 620 kb fragment of mitochondria 
DNA, and the rice genome contains a 
113 kb fragment of chloroplast DNA.-**'-^ 
The papaya genome contains a number 
of insertions totaling 785,000 bp of chlo- 
roplast and 858,190 bp of mitochondria 
DNA, representing 0.28% and 0.31% of 
the genome respectively.'* Most nuclear 
chloroplast DNAs are less than 1 kb in 
length, as insertions are quickly shuffled 
and eliminated, with an estimated 80% 
of insertions lost within a million years of 
integration.^'' 

In the absence of recombination, the 
papaya HSY has accumulated a staggering 
amount of chloroplast DNA (Fig. 1). The 
HSY has 195 insertions totaling 93, 824bp, 
representing 1.15% of the total sequence. 
Chloroplast insertions are four times 
higher than the genome average and 12 
times higher than the X chromosome. 
Insertions range in size from 50-4,500 
bp including 20 insertions larger than 1 
kb. In the autosome, these large fragments 
would have been shuffled and eventually 
lost due to recombination, but the lack of 
recombination in the HSY has instead, 
fixed them. 

Perhaps the most interesting feature 
of the accumulated chloroplast DNA is a 
fragment of the chloroplast genome con- 
taining the rspl5 gene scattered through- 
out the HSY 23 times. These rspl5 
fragments are nearly identical to each other 
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but divergent from the chloroplast with 
estimated insertion times ranging from 
20—25 milHon years ago, well before the 
inception of sex chromosomes in papaya. 
It is unhkely that the rspl5 fragments 
arose from independent insertion events; 
they were probably transposed across the 
HSY tagging along with active retrotrans- 
posons. Indeed, intact retrotransposons 
are found in close proximity to most rspl5 
fragments, supporting this hypothesis. No 
copies of the rspl5 fragment are found 
elsewhere in the papaya genome or in the 
X chromosome. 

Interestingly, only 10% of the chloro- 
plast fragments are shared between the X 
and HSY. Most fragments either integrated 
after sex chromosome inception, or were 
eliminated from one of the sexes. The lack 
of recombination and prevalence of large 
fragments in the HSY suggests elimina- 
tion isn't responsible. Unlike chloroplast 
insertions, mitochondria genome inser- 
tions are less abundant in the HSY than 
the rest of the papaya genome, encompass- 
ing only 16,458 bp or 0.1% of the total 
sequence. Mitochondria insertions are 
short with an average size of 310 bp and 
only two are larger than 1 kb. Chloroplast 
and mitochondria integrations are equally 
prevalent in the papaya autosome, raising 
the question of why chloroplast integra- 
tions are more frequent in the HSY. 

Organelle DNA accumulation isn't 
limited to the papaya HSY; the Y chro- 
mosome of humans has been shown to 
preferentially accumulate mitochondrial 
DNA^* and the Y chromosome of S. lati- 
folia is accumulating chloroplast DNA.^' 
In contrast to the papaya HSY, the Y chro- 
mosome in liverwort contains hundreds 
of mitochondria derived sequences, but 
only 3 plastid insertions.^" The accumula- 
tion of organelle DNA is a result of sup- 
pressed recombination, but a mechanism 
explaining the preference of mitochon- 
drial or chloroplast sequences is currently 
unknown. Sequencing additional plant 
sex chromosomes may shed light on this 
phenomenon. 

Future Prospects 
of Sex Chromosome Evolution 

The study of sex chromosome evolution 
is often hindered by a lack of complete 



sequence information. Until recently, only 
the Y chromosomes from humans, rhesus 
monkey, chimpanzee and liverwort, and 
the chicken Z were available for compara- 
tive analysis. Sequencing sex chromosomes 
is difficult, as the heteromorphic chromo- 
somes are highly repetitive, contain dupli- 
cated regions and have areas of homology 
between the chromosome types, making 
whole genome shotgun sequencing not 
suitable for assembling sex chromosomes. 
This is further complicated by the lack of 
informative genetic markers for sequence 
anchoring and orientation due to the 
suppressed recombination in the Y or Z 
chromosome. Even with current next gen- 
eration sequencing technologies, the best 
approach for sequencing sex chromosomes 
is a laborious and tedious BAG by BAG 
approach. 

The absence of recombination wreaks 
havoc on the gene content and struc- 
ture in heteromorphic sex chromosomes. 
Ancient Y chromosomes like those found 
in humans and other mammals are highly 
degenerate, gene poor and are drastically 
shrunken in size compared with their 
corresponding X chromosomes. It's diffi- 
cult to trace the events that shaped these 
ancient Y chromosomes or the early stages 
of their evolution. The papaya X and Y'' 
are only 7 million years old and provide 
detailed information of what occurs early 
on in sex chromosome evolution in this 
sex chromosome system. 

Despite its young age, the papaya 
HSY is already showing signs of degen- 
eration with changes in gene content and 
repeat expansion. Retrotransposons have 
expanded the HSY to almost three times 
the size of the X and have translocated 10 
genes from the autosome in the process. In 
addition to the Ty3-gyspy retrotransposons 
and Tyl-copia elements, we identified over 
20 HSY specific repeats which together, 
represent 10% of the total sequence. 
Furthermore, a fragment of the chloro- 
plast genome has translocated 23 times 
in the HSY, evidence of highly active ret- 
rotransposons. Ghloroplast integrations 
are far more common in the HSY than the 
rest of the genome, reflecting the function 
of recombination to shuffle and eliminate 
promiscuous DNA. 

Sex chromosomes in plants display a 
range of evolutionary stages including 



incipient, early stage and late stage. To 
understand the underlying processes 
controlling sex chromosome evolution, it 
is necessary to study organisms with sex 
chromosomes at each of these stages. The 
sequenced sex chromosomes in papaya 
shed light on these processes, but sequenc- 
ing sex chromosomes at additional stages 
is needed for a more complete picture of 
how sex chromosomes evolve. 
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